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1 Introduction 

Quantum computing is a new computational paradigm created by reformulating in- 
formation and computation in a quantum mechanical framework |30, 27|. Since the 
laws of physics appear to be quantum mechanical, this is the most relevant frame- 
work to consider when considering the fundamental limitations of information pro- 
cessing. Furthermore, in recent decades we have seen a major shift from just observ- 
ing quantum phenomena to actually controlling quantum mechanical systems. We 
have seen the communication of quantum information over long distances, the "tele- 
portation" of quantum information, and the encoding and manipulation of quantum 
information in many different physical media. We still appear to be a long way from 
the implementation of a large-scale quantum computer, however it is a serious goal 
of many of the world's leading physicists, and progress continues at a fast pace. 

In parallel with the broad and aggressive program to control quantum mechan- 
ical systems with increased precision, and to control and interact a larger number 
of subsystems, researchers have also been aggressively pushing the boundaries of 
what useful tasks one could perform with quantum mechanical devices. These in- 
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elude improved metrology, quantum communication and cryptography, and the im- 
plementation of large-scale quantum algorithms. 

It was known very early on |27 1 that quantum algorithms cannot compute func- 
tions that are not computable by classical computers, however they might be able to 
efficiently compute functions that are not efficiently computable on a classical com- 
puter Or, at the very least, quantum algorithms might be able to provide some sort 
of speed-up over the best possible or best known classical algorithms for a specific 
problem. 

The purpose of this paper is to survey the field of quantum algorithms, which has 
grown tremendously since Shor's breakthrough algorithms |57 58 1 over 15 years 
ago. Much of the work in quantum algorithms is now textbook material (e.g. 1541 
[39ll45ll43ll49l ), and we will only briefly mention these examples in order to provide 
a broad overview. Other parts of this survey, in particular, sections [4] and [5] give a 
more detailed description of some more recent work. 

We organized this survey according to an underlying tool or approach taken, and 
include some basic applications and specific examples, and relevant comparisons 
with classical algorithms. 

In section |2] we begin with algorithms one can naturally consider to be based 
on a quantum Fourier transform, which includes the famous factoring and discrete 
logarithm algorithms of Peter Shor Il57l l58l . Since this topic is covered in several 
textbooks and recent surveys, we will only briefly survey this topic. We could have 
added several other sections on algorithms for generalizations of these problems, 
including several cases of the non-Abelian hidden subgroup problem, and hidden 
lattice problems over the reals (which have important applications in number the- 
ory), however these are covered in the recent survey |52 | and in substantial detail in 

ma. 

We continue in section[3]with a brief review of classic results on quantum search- 
ing and counting, and more generally amplitude amplification and amplitude esti- 
mation. 

In section]?] we discuss algorithms based on quantum walks. We will not cover 
the related topic of adiabatic algorithms, which was briefly summarized in [|52|; a 
broader survey of this and related techniques ("quantum annealing") can be found 
in 126i. 

We conclude with section ]5] on algorithms based on the evaluation of the trace 
of an operator, also referred to as the evaluation of a tensor network, and which has 
applications such as the approximation of the Tutte polynomial. 

The field of quantum algorithms has grown tremendously since the seminal work 
in the mid-1990s, and a full detailed survey would simply be infeasible for one ar- 
ticle. We could have added several other sections. One major omission is the de- 
velopment of algorithms for simulating quantum mechanical systems, which was 
Feynman's original motivation for proposing a quantum computer This field was 
briefly surveyed in |52|, with emphasis on the recent results in |131 (more recent 
developments can be found in ||63]| ). This remains an area worthy of a comprehen- 
sive survey; like the many other areas we have tried to survey, it is difficult because 
it is still an active area of research. It is also an especially important area because 
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these algorithms tend to offer a fully exponential speed-up over classical algorithms, 
and thus are likely to be among the first quantum algorithms to be implemented that 
will offer a speed-up over the fastest available classical computers. 

Lastly, one could also write a survey of quantum algorithms for intrinsically 
quantum information problems, like entanglement concentration, or quantum data 
compression. We do not cover this topic in this article, though there is a very brief 
survey in 152|. 

One can find a rather comprehensive list of the known quantum algorithms (up to 
mid 2008) in Stephen Jordan's PhD thesis f4T]. We hope this survey complements 
some of the other recent surveys in providing a reasonably detailed overview of the 
current state of the art in quantum algorithms. 



2 Algorithms based on the Quantum Fourier transform 

The early line of quantum algorithms was developed in the "black-box" or "ora- 
cle" framework. In this framework part of the input is a black-box that implements 
a function f{x), and the only way to extract information about / is to evaluate it 
on inputs x. These early algorithms used a special case of quantum Fourier trans- 
form, the Hadamard gate, in order solve the given problem with fewer black-box 
evaluations of / than a classical algorithm would require. 

Deutsch 1 27 1 formulated the problem of deciding whether a function / : {0, 1 } — > 
{0, 1} was constant or not. Suppose one has access to a black-box that implements / 
reversibly by mapping x,Qi—i' x,f{x); let us further assume that the black box in fact 
implements a unitary transformation Uf that maps \x) |0) t-^ \x) \ f{x)). Deutsch's 
problem is to output "constant" if /(O) = /(I) and to output "balanced" if /(O) 7^ 
/(I), given a black-box for evaluating /. In other words determine /(O) ®/(l) 
(where © denotes addition modulo 2). Outcome "0" means / is constant and "1" 
means / is not constant. 

A classical algorithm would need to evaluate / twice in order to solve this prob- 
lem. A quantum algorithm can apply Uf only once to create 

^|0) 1/(0)) + 1/(1)). 

Note that if /(O) = /(I), then applying the Hadamard gate to the first register 
yields |0) with probability 1, and if /(O) 7^ /(I), then applying the Hadamard gate 
to the first register and ignoring the second register leaves the first register in the 
state 1 1) with probability ^; thus a result of 1 1) could only occur if /(O) 7^ /(I)- 

As an aside, let us note that in general, given 
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applying the Hadamard gate to the first qubit and measuring it will yield "0" with 
probability ^ + Re{{\j/o\\j/i)) ; this "Hadamard test" is discussed in more detail in 
section |5] 

In Deutsch's case, measuring a "1" meant /(O) 7^/(1) with certainty, and a "0" 
was an inconclusive result. Even though it wasn't perfect, it still was something that 
couldn't be done with a classical algorithm. The algorithm can be made exact |24| 
(i.e. one that outputs the correct answer with probability 1) if one assumes further 
that Uf maps |x) \b) \x) \b(B f{x)), for b e {0, 1}, and one sets the second qubit 
to ^ |0)-^|1). Then f// maps 



(^)(^)-'-'-( '"'^'-'r"'"' )(^)' 



and measuring the first register yields the correct answer with certainty. 

The general idea behind the early quantum algorithms was to compute a black- 
box function / on a superposition of inputs, and then extract a global property of / 
by applying a quantum transformation to the input register before measuring it. We 
usually assume we have access to a black-box that implements 



or in some other form where the input value x is kept intact and the second register 
is shifted by /(x) is some reversible way. 

Deutsch and Jozsa f28l used this approach to get an exact algorithm that decides 
whether / : {0, 1}" h-> {0, 1} is constant or "balanced" (i.e. |/"^(0)| 1/^^1)1)' 
with a promise that one of these two cases holds. Their algorithm evaluated / 
only twice, while classically any exact algorithm would require 2"^' + 1 queries 
in the worst-case. Bernstein and Vazirani |12| defined a specific class of such 
functions /a : x i-> a • x, for any a € {0, 1}", and showed how the same algorithm 
that solves the Deutsch-Jozsa problem allows one to determine a with two evalua- 
tions of /a while a classical algorithm requires n evaluations. (Both of these algo- 
rithms can be done with one query if we have |x) \b) i— )■ |x) \b © /(x)) .) They further 
showed how a related "recursive Fourier sampling" problem could be solved super- 
polynomially faster on a quantum computer. Simon |59 1 later built on these tools to 
develop a black-box quantum algorithm that was exponentially faster than any clas- 
sical algorithm for finding a hidden string s G {0, 1}" that is encoded in a function 
/ : {0, 1 }" ^{0,1 }" with the property that /(x) = /(y) if and only if x y s. 

Shor ifSTl [38 1 built on these black-box results to find an efficient algorithm for 
finding the order of an element in the multiplicative group of integers modulo 
(which implies an efficient classical algorithm for factoring A^) and for solving the 



Thus a Hadamard gate on the first qubit yields the result 




f// : |x) |b) ^ |x) |b®/(x)) 
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discrete logarithm problem in the multiplicative group of integers modulo a large 
prime p. Since the most widely used public key cryptography schemes at the time 
relied on the difficulty of integer factorization, and others relied on the difficulty 
of the discrete logarithm problem, these results had very serious practical impli- 
cations. Shor's algorithms straightforwardly apply to black-box groups, and thus 
permit finding orders and discrete logarithms in any group that is reasonably pre- 
sented, including the additive group of points on elliptic curves, which is currently 
one of the most widely used public key cryptography schemes (see e.g. |48|). 

Researchers tried to understand the full implications and applications of Shor's 
technique, and a number of generalizations were soon formulated (e.g. |15, 36 1). 
One can phrase Simon's algorithm, Shor's algorithm, and the various generaliza- 
tions that soon followed as special cases of the hidden subgroup problem. Consider 
a finitely generated Abelian group G, and a hidden subgroup K that is defined by 
a function f : G ^ X (for some finite set X) with the property that f{x) = f{y) if 
and only if x — y E K (we use additive notation, without loss of generality). In other 
words, / is constant on cosets of K and distinct on different cosets of G. In the case 
of Simon's algorithm, G = Z'j and K — {0,s}. In the case of Shor's order-finding 
algorithm, G — Z and K — rZ where r is the unknown order of the element. Other 
examples and how they fit in the hidden subgroup paradigm are given in |51 1. 

Soon after, Kitaev |46| solved a problem he called the Abelian stabilizer prob- 
lem using an approach that seemed different from Shor's algorithm, one based in 
eigenvalue estimation. Eigenvalue estimation is in fact an algorithm of independent 
interest for the purpose of studying quantum mechanical systems. The Abelian sta- 
bilizer problem is also a special case of the hidden subgroup problem. Kitaev's idea 
was to turn the problem into one of estimating eigenvalues of unitary operators. In 
the language of the hidden subgroup problem, the unitary operators were shift op- 
erators of the form f{x) i—>-f{x+y). By encoding the eigenvalues as relative phase 
shifts, he turned the problem into a phase estimation problem. 

The Simon/Shor approach for solving the hidden subgroup problem is to first 
compute Y.X In the case of finite groups G, one can sum over all the el- 

ements of G, otherwise one can sum over a sufficiently large subset of G. For ex- 
ample, if G = Z, and /(x) = mod for some large integer A^, we first compute 
^x=o' |x) \a^), where 2" > A^^ (we omit the " mod A^" for simplicity). If r is the order 
of a (i.e. r is the smallest positive integer such that a'' = 1) then every value x of the 
form X —y + kr gets mapped to a-''. Thus we can rewrite the above state as 

'l WK> = L(Lb+»)K) (1) 

jc=0 .y=0 j 

where each value of in this range is distinct. Tracing out the second register we 
thus are left with a state of the form 



j 
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for a random y and where j goes from to [(2" — 1 ) /r\ . We loosely refer to this state 
as a "periodic" state with period r. We can use the inverse of the quantum Fourier 
transform (or the quantum Fourier transform) to map this state to a state of the form 
Y.x(^x \x), where the amplitudes are biased towards values of x such that x/2" w k/r. 
With probability at least ^ we obtain an x such that \x/2" —k/r\ < 1/2"+'. One 
can then use the continued fractions algorithm to find k/r (in lowest terms) and 
thus find r with high probability. It is important to note that the continued fractions 
algorithm is not needed for many of the other cases of the Abelian hidden subgroup 
considered, such as Simon's algorithm or the discrete logarithm algorithm when the 
order of the group is akeady known. 

In contrast, Kitaev's approach for this special case was to consider the map f/„ : 
\b) I— |/7fl'^). It has eigenvalues of the form e^^'^^'', and the state |1) satisfies 

where 1 1//^) is an eigenvector with eigenvalue g^TTf^/r. 

If we consider the controlled-t/„, denoted c — Ua, which maps |0) \b) ^ |0) \b) and 
|1) 1^) i-> |1) \ba), and if we apply it to the state (|0) + we get 

(|0) + |l))|v/.>^(|0>+.2-V'-|i))|^,). 

In other words, the eigenvalue becomes a relative phase, and thus we can reduce 
eigenvalue estimation to phase estimation. Furthermore, since we can efficiently 
compute fl^' by performing j multiplications modulo A^, one can also efficiently 
implement c — U^^i ™d thus easily obtain the qubit |0) +e2m2^(A7/-) ^^j. jjjfgggj- 
values of j without performing c — a total of 2-' times. Kitaev developed an effi- 
cient ad hoc phase estimation scheme in order to estimate k/r to high precision, and 
this phase estimation scheme could be optimized further ll24l . In particular, one can 
create the state 

(io)+ii))"ii>= l'ix)|i> = i:'e wiv^*) 

K=Q k=0 x=0 

(we use the standard binary encoding of the integers x E {0, 1, ... ,2" — 1} as bits 
strings of length n) the apply the c — t/ 2/ using the (n — j)th bit as the control bit, 
and using the second register (initiaUzed in |1) =Y,k IWk)) as the target register, for 
7 = 0, 1,2, ... ,n — 1, to create 

I,(|0) +^2-2-4 . . . (|0) +^2-2^ |1))(|0) +^.2.,f 1^^^ (2) 

= k) = n-JoLt'o' e^""'^' |x) I V/,) . (3) 
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If we ignore or discard the second register, we are left with a state of the form 
Y.x^^ g2;rai:/r j^j.^ f^j. ^ random value of ^ G {0, 1, . . . , r — 1}. The inverse quantum 
Fourier transformation maps this state to a state ay \y) where most of weight 
of the amplitudes is near values of y such that y/2" sa k/r for some integer k. 
More specifically \y/2" ~k/r\ < 1 /2"+' with probability at least 4/;r^; furthermore 
\y/2" —k/r\ < 1/2" with probability at least S/n^. As in the case of Shor's algo- 
rithm, one can use the continued fractions algorithm to determine k/r (in lowest 
terms) and thus determine r with high probability. 

It was noted f24] that this modified eigenvalue estimation algorithm for order- 
finding was essentially equivalent to Shor's period-finding algorithm for order- 
finding. This can be seen by noting that we have the same state in Equation [T] and 
Equation |2j and in both cases we discard the second register and apply an inverse 
Fourier transform to the first register The only difference is the basis in which the 
second register is mathematically analyzed. 

The most obvious direction in which to try to generalized the Abelian hid- 
den subgroup algorithm is to solve instances of the hidden subgroup problem for 
non-Abelian groups. This includes, for example, the graph automorphism problem 
(which corresponds to finding a hidden subgroup of the symmetric group). There 
has been non-trivial, but limited, progress in this direction, using a variety of al- 
gorithmic tools, such as sieving, "pretty good measurements" and other group the- 
oretic approaches. Other generalizations include the hidden shift problem and its 
generalizations, hidden lattice problems on real lattices (which has important appli- 
cations in computational number theory and computationally secure cryptography), 
and hidden non-linear structures. These topics and techniques would take several 
dozen pages just to summarize, so we refer the reader to ll52l or ll22l . and leave 
more room to summarize other important topics. 



3 Amplitude Amplification and Estimation 

A very general problem for which quantum algorithms offer an improvement is that 
of searching for a solution to a computational problem in the case that a solution x 
can be easily verified. We can phrase such a general problem as finding a solution 
X to the equation f{x) = 1, given a means for evaluating the function /. We can 
further assume that / : {0, 1}" {0, 1}. 

The problems from the previous section can be rephrased in this form (or a small 
number of instances of problems of this form), and the quantum algorithms for these 
problems exploit some non-trivial algebraic structure in the function / in order to 
solve the problems superpolynomially faster than the best possible or best known 
classical algorithms. Quantum computers also allow a more modest speed-up (up to 
quadratic) for searching for a solution to a function / without any particular struc- 
ture. This includes, for example, searching for solutions to A^P-complete problems. 

Note that classically, given a means for guessing a solution x with probability p, 
one could amplify this success probability by repeating many times, and after a num- 
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ber of guess in (9(1 /p), the probability of finding a solution is in i2(l). Note that the 
quantum implementation of an algorithm that produces a solution with probability 
p will produce a solution with probability amplitude y/p. The idea behind quan- 
tum searching is to somehow amplify this probability to be close to 1 using only 
0(1/ ^/p) guesses and other steps. 

Lov Grover |37| found precisely such an algorithm, and this algorithm was 
analyzed in detail and generalized |16, 11. 38 18 1 to what is known as "am- 
plitude amplification." Any procedure for guessing a solution with probability p 
can be (with modest overhead) turned into a unitary operator A that maps |0) i-> 
y/p + \/l ^ P I Vb) , where = 00 ... 0, | i/i ) is a superposition of states encod- 
ing solutions X to /(x) = 1 (the states could in general encode x followed by other 
"junk" information) and is a superposition of states encoding values of x that 
are not solutions. 

One can then define the quantum search iterate (or "Grover iterate") to be 

Q = -AUQA-^Uf 

where Uf : |x) (-l)-^W |x>, and Uq = / -2|0) (0| (in other words, maps |0) 
— |0) and |x) i-^ |x) for any x ^ 0). Here we are for simplicity assuming there are 
no "junk" bits in the unitary computation of / by A. Any such junk information can 
either be "uncomputed" and reset to all Os, or even ignored (letting U f act only on 
the bits encoding x and applying the identity to the junk bits). 

This algorithm is analyzed thoroughly in the literature and in textbooks, so we 
only summarize the main results and ideas that might help understand the later sec- 
tions on searching via quantum walk. 

If one applies Q a total of k times to the input state A |00. . .0) = sin(0) + 
cos(0) |)/o), where y//? = sin(0), then one obtains 

e'^A 1 00 . . . 0) = sin ( (2A: + 1 ) ) I v/i > + cos ( (2A: + 1 ) ) I v/o > • 

This implies that with k w one obtains | i/i ) with probabihty amplitude close 
to 1, and thus measuring the register will yield a solution to f{x) = 1 with high 
probability. This is quadratically better than what could be achieved by preparing 
and measuring A |0) until a solution is found. 

One application of such a generic amplitude amplification method is for search- 
ing. One can also apply this technique to approximately count [19.1 the number of 
solutions to f{x) = 1, and more generally to estimate the amplitude ifTSl with which 
a general operator A produces a solution to f{x) = 1 (in other words, the transition 
amplitude from one recognizable subspace to another). 

There are a variety of other applications of amplitude amplification that cleverly 
incorporate amplitude amplification in a non-obvious way into a larger algorithm. 
Some of these examples are discussed in 1.52] and most are summarized at [41 1. 

Since there are some connections to some of the more recent tools developed in 
quantum algorithms, we will briefly explain how amplitude estimation works. 



Algorithms for Quantum Computers 



9 



Consider any unitary operator A that maps some known input state, say |0) = 
|00...0), to a superposition sin(0) |v/i) +cos(0) |v/o), < < 7z/2, where |v/i) 
is a normaUzed superposition of "good" states x satisfying /(x) = 1 and |\//o) is a 
normalized superposition of "bad" states x satisfying /(x) = (again, for simpHc- 
ity we ignore extra junk bits). If we measure A |0), we would measure a "good" x 
with probability sin^(0). The goal of amplitude estimation is to approximate the 
amplitude sin(0). Let us assume for convenience that there are f > good states 
and n - r > bad states, and that < sin(0) < n/2. 

We note that the quantum search iterate Q = —AUoA^ ' Uf has eigenvalues 1 1//+) = 
:j^{\Wo) +i\Wi)) ™d \ W-) = :j^i\Wo) ^i\Wi)) with respective eigenvalues e'^^ and 
e^'^^ . It also has 2" — 2 other eigenvectors; f — 1 of them have eigenvalue +1 and 
are the states orthogonal to \y/i) that have support on the states |x) where /(x) = 1, 
and n — t—1 of them have eigenvalue — 1 and are the states orthogonal to | i/o) that 
have support on the states |x) where /(x) = 0. It is important to note that A\y/) = 
^ \ Y+) + ^-if \W-) has its full support on the two dimensional subspace spanned 
by and |v/-). 

It is worth noting that the quantum search iterate Q — —AUoA^^Uf can also be 
thought of as two reflections 

-f4,0)f// = (2A|0)(0|At-/)(/-2 Y. 

''|/(x) = l 

one which flags the "good" subspace with a —1 phase shift, and then one that flags 
the subspace orthogonal to A |0) with a —1 phase shift. In the two dimensional sub- 
space spanned by A |0) and UfA\Q), these two reflections correspond to a rotation by 
angle 29. Thus it should not be surprising to find eigenvalues e^^Tr/e ^^j. g^^jgg jjj (jjjg 
two dimensional subspace. (In the section on quantum walks, we'll consider a more 
general situation where we have an operator Q with many non-trivial eigenspaces 
and eigenvalues.) 

Thus one can approximate sin(0) by estimating the eigenvalue of either \yf+) 
or I V'^- ) ■ Performing the standard eigenvalue estimation algorithm on Q with input 
A |0) (as illustrated in Figure [T]) gives a quadratic speed-up for estimating sin^(0) 
versus simply repeatedly measuring A |0) and counting the frequency of Is. In par- 
ticular, we can obtain an estimate of such that |0 — 0| < £ with high (constant) 
probability using (9(l/e) repetitions of Q, and thus (9(1 /e) repetitions of Uf, A and 
A^'. For fixed p, this implies that p = sin^(0) satisfies \p — p\ € 0(e) with high 
probability. Classically sampling requires 0(1 /e^) samples for the same precision. 
One application is speeding up the efficiency of the "Hadamard" test mentioned in 
section l5.2.3l 

Another interesting observation is that increasingly precise eigenvalue estimation 
of Q on input A |0) leaves the eigenvector register in a state that gets closer and closer 
to the mixture j |v/+) {yz+l + \ \\lf-) (V-l which equals \ \\if\) {\lf\\ + \ \Wo) {Wq\- 
Thus eigenvalue estimation will leave the eigenvector register in a state that contains 
a solution to /(x) = 1 with probability approaching ^ . One can in fact using this 
entire algorithm as subroutine in another quantum search algorithm, and obtain an 
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Fig. 1 The above circuit estimates the amplitude with which the unitary operator A maps |0) to 
the subspace of solutions to /(x) = 1. One uses m 6 I2(log(l/e)) qubits in the top register, and 
prepares it in a uniform superposition of the strings y representing the integers 0, 1,2, . . . ,2™ — 1 
(one can in fact optimize the amplitudes of each y to achieve a slightly better estimate |33 1). The 
controlled-2' circuit applies to the bottom register when the value y is encoded in the top qubits. 
If A|0) = sin(9) +cos(6) Ivo). where is a superposition of solutions x to /(x) = 1, and 
I y/b) is a superposition of values x with /(x) = 0, then the value y = yiy2 ■ ■ - ym measured in the top 
register corresponds to the phase estimate 2Ky/2"' which is likely to be within |§ (modulo 2n) of 
either 20 or -20. Thus the value of sin^ ny/2'" is likely to satisfy | sin^ ny/2"' - sir? B\ € 0(e). 



algorithm with success probabiHty approaching 1 fSOl |43]| . and the convergence 
rate can be improved further [61J. Another important observation is that for the 
purpose of searching, the eigenvalue estimate register is never used except in order 
to determine a random number of times in which to apply Q to the second register. 
This in fact gives the quantum searching algorithm of 1 16 1. In other words, applying 
Q a random number of times decoheres or approximately projects the eigenvector 
register in the eigenbasis, which gives a solution with high probability. The method 
of approximating projections by using randomized evolutions was recently refined, 
generalized, and applied in 1741 . 

Quantum searching as discussed in this section has been further generalized in 
the quantum walk paradigm, which is the topic of the next section. 
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U <] 

Fig. 2 The amplitude estimation circuit can also be used for searching for a solution to /(x) = 1, 
as illustrated in the figure on the left. The second register starts off in the state A|0), and after 
a sufficiently precise eigenvalue estimation of the quantum search iterate Q, the second regis- 
ter is approximately projected in the eigenbasis. The idea projection would yield the mixed state 
i |V+> (V+I + 5 IV"-> (V-l = 5 IVi) + 5 IVb) (Vo|, and thus yields a solution to /(x) = 1 with 
probabiUty approaching ^ as m gets large (the probability is in ^2(1) once m e 12(1/0?)). The 
same algorithm can in fact be achieved by discarding the top register and instead randomly picking 
a value y e {0, l,...,!"* — 1} and applying toA|0), as illustrated on the right. 

4 Quantum Walks 

Random walks are a common tool throughout classical computer science. Their 
applications include the simulation of biological, physical and social systems, as 
well as probabilistic algorithms such as Monte Carlo methods and the Metropolis 
algorithm. A classical random walk is described by a « x n matrix P, where the entry 
Pu,v is the probability of a transition from a vertex v to an adjacent vertex m in a graph 
G. In order to preserve normalization, we require that P is stochastic — that is, the 
entries in each column must sum to 1 . We denote the initial probability distribution 
on the vertices of G by the column vector q. After n steps of the random walk, the 
distribution is given by P''q. 

Quantum walks were developed in analogy to classical random walks, but it was 
not initially obvious how to do this. Most generally, a quantum walk could begin in 
an initial state po = | V/b)(Vb| and evolve according to any completely positive map 

5 such that, after ? time steps, the system is in state p{t) = S"{po). Such a quantum 
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walk is simultaneously using classical randomness and quantum superposition. We 
can focus on the power of quantum mechanics by restricting to unitary walk opera- 
tions, which maintain the system in a coherent quantum state. So, the state at time 
t can be described by \'^f{t)) — U' \'^o), for some unitary operator U. However, it 
is not initially obvious how to define such a unitary operation. A natural idea is to 
define the state space A with basis {|v) : v € V{G)} and walk operator P defined by 
Pu.v — yJPii.v However, this will not generally yield a unitary operator P, and a more 
complex approach is required. Some of the earliest formulations of unitary quantum 
walks appear in papers by Nayak and Vishwanath |53|, Ambainis et al. [81, Kempe 
|[44|, and Aharonov et al. |2|. These early works focused mainly on quantum walks 
on the line or a cycle. In order to allow unitary evolution, the state space consisted 
of the vertex set of the graph, along with an extra "coin register" The state of the 
coin register is a superposition of |Left) and | Right). The walk then proceeds by 
alternately taking a step in the direction dictated by the coin register and applying 
a unitary "coin tossing operator" to the coin register. The coin tossing operator is 
often chosen to be the Hadamard gate. It was shown in l2l|6l|8]|44l[53] that the mix- 
ing and propagation behaviour of these quantum walks was significantly different 
from their classical counterparts. These early constructions developed into the more 
general concept of a discrete time quantum walk, which will be defined in detail. 

We will describe two methods for defining a unitary walk operator. In a discrete 
time quantum walk, the state space has basis vectors {\u) ® \v) : m, v e V}. Roughly 
speaking, the walk operator alternately takes steps in the first and second registers. 
This is often described as a walk on the edges of the graph. In a continuous time 
quantum walk, we will restrict our attention to symmetric transition matrices P. 
We take P to be the Hamiltonian for our system. Applying Schrodinger's equation, 
this will define continuous time unitary evolution in the state space spanned by 
{\u) : M G y}. Interestingly, these two types of walk are not known to be equivalent. 
We will give an overview of both types of walk as well as some of the algorithms 
that apply them. 



4.1 Discrete Time Quantum Walks 

Let f be a stochastic matrix describing a classical random walk on a graph G. We 
would like the quantum walk to respect the structure of the graph G, and take into 
account the transition probabilities Pj, ,,. The quantum walk should be governed by 
a unitary operation, and is therefore reversible. However, a classical random walk is 
not, in general, a reversible process. Therefore, the quantum walk will necessarily 
behave differently than the classical walk. While the state space of the classical walk 
is V, the state quantum walk takes place in the space spanned by v) : u,v G V}. 
We can think of the first register as the current location of the walk, and the second 
register as a record of the previous location. To facilitate the quantum walk from a 
state |m, v), we first mix the second register over the neighbours of u, and then swap 
the two registers. The method by which we mix over the neighbours of u must be 



Algorithms for Quantum Computers 



13 



chosen carefully to ensure that it is unitary. To describe this formally, we define the 
following states for each m e V : 

|V/„) = |m)(»1^ /f^lv) (4) 

vev 

w:)^Y.^u\v)®\u). (5) 

vGV 

Furthermore, define the projections onto the space spanned by these states: 

n=Y^Wu){wu\ (6) 

(7) 

In order to mix the second register, we perform the reflection (277 — /) . Letting S 
denote the swap operation, this process can be written as 

S(2n-I). (8) 

It turns out that we will get a more elegant expression for a single step of the quan- 
tum walk if we define the walk operator W to be two iterations of this process: 

1^=5(217-7)5(217-/) (9) 

={2{sns)-i)(2n-i) (10) 
=(2n*-/)(27r-/). (11) 

So, the walk operator W is equivalent to performing two reflections. 

Many of the useful properties of quantum walks can be understood in terms of 
the spectrum of the operator W. First, we define D, the n x n matrix with entries 
Du,v = \/PuA-Pv,u- This is called the discriminant matrix, and has eigenvalues in the 
interval [0,1]. In the theorem that follows, the eigenvalues of D that lie in the interval 
(0, 1) will be expressed as cos(0i), ...,cos(0i). Let , 10^) be the correspond- 
ing eigenvectors of D. Now, define the subspaces 

A=span{\-^u)} (12) 
B ^ span{\\if:)} . (13) 



Finally, define the operator 



and 



2=Ilrv)(v| (14) 



vev 



\i>j)-Q\Qj)- (15) 
We can now state the following spectral theorem for quantum walks: 

Theorem 1 (Szegedy, II60II ). The eigenvalues ofW acting on the space A+B can be 
described as follows: 
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1. The eigenvalues ofW with non-zero imaginary part are e*^'^',e*^'^2 _ ^g±2'% 
where cos(0i), ...,cos(0|(.) are the eigenvalues of D in the interval (0,1). The 
corresponding (un-normalized) eigenvectors ofW{P) can be written as \<pj^ — 
e^^'^jS\^j)forj^l,...,k. 

2. ARB andA^DB^ span the +1 eigenspace ofW. There is a direct correspon- 
dence between this space and the +1 eigenspace of D. In particular, the +1 
eigenspace ofW has the same degeneracy as the +1 eigenspace ofD. 

3. AOB-^ and OB span the —1 eigenspace ofW. 

We say that P is symmetric \f = P and ergodic if it is aperiodic. Note that if P is 
symmetric, then the eigenvalues of D are just the absolute values of the eigenvalues 
of P. It is well-known that if P is ergodic, then it has exactly one stationary distri- 
bution (i.e. a unique +1 eigenvalue). Combining this fact with theorem ([T} gives us 
the following corollary: 

Corollary 1. If P is ergodic and symmetric, then the corresponding walk operator 
W has unique +1 eigenvector in Span{A,B): 

w) = ^Y^w.)^^Y^w:) (16) 

Moreover, if we measure the first register of we get a state corresponding to 
vertex u with probability 

PK«) = -E^- = -. (17) 

This is the uniform distribution, which is the unique stationary distribution for the 
classical random walk. 



4.1.1 The Phase Gap and the Detection Problem 

In this section, we will give an example of a quadratic speedup for the problem of 
detecting whether there are any "marked" vertices in the graph G. First, we define 
the following: 

Definition 1. The phase gap of a quantum walk is defined as the smallest postive 
value 29 such that e^^'^ are eigenvalues of the quantum walk operator It is denoted 
hy A{P). 

Definition 2. Let M C y be a set of marked vertices. In the detection problem, we 
are asked to decide whether M is empty. 

In this problem, we assume that P is symmetric and ergodic. We define the following 
modified walk P': 

Kv=\^ u^v,veM (18) 

[ 1 M = V, V e M. 
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This walk resembles P, except that it acts as the identity on the set M. That is, if the 
walk reaches a marked vertex, it stays there. Let Pm denote the operator P' restricted 
to y \ M. Then, arranging the rows and columns of P' , we can write 

^'-ft?V (19) 



By Theoremjl] if M = 0, then Pm = P' ^ P and \\Pm\\ = 1- Otherwise, we have the 
strict inequality ||-Pm|| < 1. The following theorem bounds ||-Pm|| away from 1: 



Theorem 2. If (I — 5) is the absolute value of the eigenvalue of P with second 

2 



largest magnitude, and \M\ > £ \V\, then \\Pm\\ < 1 ^ 



We will now show that the detection problem can be solved using eigenvalue esti- 
mation. Theorem |2] will allow us to bound the running time of this method. First, 
we describe the discriminant matrix for P': 

u,v 

D{P')uv^{l u^v.veM (20) 
otherwise. 

Now, beginning with the state 

IV') = 4 E IV^v) = 4^ E 1^) ' (21) 

V" vev V " vev 

we measure whether or not we have a marked vertex; if so, we are done. Otherwise, 
we have the state 

\Vm)= J—. L v^|«)|v>. (22) 

VW\M\u.vt^\M 



If M = 0, then this is the state \\(f) defined in (16 1, and is the +1 eigenvector of 
W{P). Otherwise, by Theoremjl] this state lies entirely in the space spanned by 
eigenvectors with values of the form e'^^'^J, where Oj is an eigenvalue of Pm- Ap- 
plying Theorem [2] we know that 

1, 5e, Ids 
0>cos-i(l-— )>-^ — . (23) 

So, the task of distinguishing between M being empty or non-empty is equivalent 
to that of distinguishing between a phase parameter of and a phase parameter of 

at least Therefore, applying phase estimation to W{P') on state with 

precision (9(%/5e) will decide whether M is empty with constant probability. This 



requires time (9(^=) 
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By considering the modified walk operator P' , it can be shown that the detection 
problem requires 0{^) time in the classical setting. Therefore the quantum algo- 
rithm provides a quadratic speedup over the classical one for the detection problem. 



4.1.2 Quantum Hitting Time 

Classically, the first hitting time is denoted H{p,M). For a walk defined by P, start- 
ing from the probability distribution p on V, H{p,M) is the smallest value n such 
that the walk reaches a marked vertex v G M at some time f e {0, . . . , n} with constant 
probability. This idea is captured by applying the modified operator P' and some n 
times, and then considering the probability that the walk is in some marked state 
V G M. Let Pm be any initial distribution restricted to the vertices V\M. Then, at 
time t, the probability that the walk is in an unmarked state is ||Pj^pM|| j, where || • || ^ 
denotes the Li norm. Assuming that M is non-empty, we can see that \\Pm\\ < l-So, 
as f — oo, we have H^mPm ||i 0. So, as f oo, the walk defined by P' is in a 
marked state with probability 1 . As a result, if we begin in the uniform distribution 
n on V, and run the walk for some time t, we will "skew" the distribution towards 
M, and thus away from the unmarked vertices. So, we define the classical hitting 
time to be the minimum t such that 

||F>M||i<e (24) 

for any constant £ of our choosing. Since the quantum walk is governed by a unitary 
operator, it doesn't converge to a particular distribution the way that the classical 
walk does. We cannot simply wait an adequate number of time steps and then mea- 
sure the state of the walk; the walk might have already been in a state with high 
overlap on the marked vertices and then evolved away from this state! Quantum 
searching has the same problem when the number of solutions is unknown. We can 
get around this in a similar way by considering an expected value over a sufficiently 
long period of time. This will form the basis of our definition of hitting time. Define 

v\y\M\ ,ev\M 

Then, if M is empty, is a +1 eigenvector of W, and W' \n) = \n) for all f. 
However, if M is non-empty, then the spectral theorem tells us that | n) lies in the 
space spanned by eigenvectors with eigenvalues e*^'^' for non-zero 9j. As a result, 
it can be shown that, for some values of t, the state W' \ 7z) is "far" from the initial 
distribution |;r). We define the quantum hitting in the same way as Szegedy ll60ll . 
The hitting time Hq{W) as the minimum value T such that 
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This leads us to Szegedy's hitting time theorem II6OII : 
Theorem 3. Tlw quantum hitting time Hq{W) is 




Corollary 2. Applying Theorem |2| if the second largest eigenvalue of P has mag- 
nitude (1 - 5) and \M\ / \V\ > e, then Hq{W) € 0{^) 

Notice that this corresponds to the running time for the algorithm for the detec- 
tion problem, as described above. Similarly, the classical hitting time is in 0{j^), 
corresponding to the best classical algorithm for the detection problem. 

It is also worth noting that, if there no marked elements, then the system remains 
in the state |;r), and the algorithm never "hits." This gives us an alternative way 
to approach the detection problem. We run the algorithm for a randomly selected 
number of steps f e {0, ...,7"} with T of size 0{^/Yj5£), and then measure whether 
the system is still in the state |;r); if there are any marked elements, then we can 
expect to find some other state with constant probability. 



4,1.3 The Element Distinctness Problem 

In the element distinctness problem, we are given a black box that computes the 
function 

f : {!,... ,n}^S (27) 

and we are asked to determine whether there exist x,y E {!,...,«} with x ^ y and 
f{x) — f{y). We would like to minimize the number of queries made to the black 
box. There is a lower bound of i2(n^/^) on the number of queries, due indirectly 
to Aaronson and Shi |1 1. The algorithm of Ambainis (1] proves that this bound is 
tight. The algorithm uses a quantum walk on the Johnson graph J{n,p) has vertex 
set consisting of all subsets of {!,...,«} of size p. Let and ^2 be /^-subsets of 
{1, ...,n}. Then, is adjacent to S2 if and only if \Si r\S2\ = p — I- The Johnson 
graph J{n,p) therefore has vertices, each with degree p{n — p). 

The state corresponding to a vertex S of the Johnson graph will not only record 
which subset {s\ , ■■■,Sp} C {!,...,«} it represents, but the function values of those 
elements. That is, 

\S) = \si,S2...,Sp;fisi),fis2),...Jisp)). (28) 

Setting up such a state requires p queries to the black box. 

The walk then proceeds for t iteration, where f is chosen from the uniform dis- 
tribution on {0 , . . . , r } and T e 0(1/ VSe ) . Each iteration has two parts to it. First, 
we need to check if there are distinct Si,Sj with f{si) = f{sj) — that is, whether the 
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vertex S is marked. This requires no calls to the black box, since the function values 
are stored in the state itself. Second, if the state is unmarked, we need to take a step 
of the walk. This involves replacing, say, s, with .sj, requiring one query to erase 
/(i,) and another to insert the value f{s\). So, each iteration requires a total of 2 
queries. 

We will now bound £ and 5. If only one pair x.y exists with f{x) — f{y), then 
there are ("jZ^ marked vertices. This tells us that, if there are any such pairs at all, 
epsilon is Q {p^ /n^)- Johnson graphs are very well-understood in graph theory. It is 
a well known result that the eigenvalues for the associated walk operator are given 
by: 

A, = l-^^^^±^ (29) 
p{n-p) 

for < /' < p. For a proof, see ll20l . This give us 5 = p(n-p) ■ Putting this together, 
we find that ^\/5e is 0{^). So, the number of queries required is 0{p + ^). 
In order to minimize this quantity, we choose m to be of size 

0(n2/3). The query 

complexity of this algorithm is (9(n^/^), matching the lower bound of Aaronson and 

Shi m. 



4.1.4 Unstructured Search as a Discrete Time Quantum Walk 

We will now consider unstructured search in terms of quantum walks. For unstruc- 
tured search, we are required to identify a marked element from a set of size n. Let 
M denote the set of marked elements, and U denote the set of unmarked elements. 
Furthermore, let m — \M\ and q = \U\. We assume that the number of marked el- 
ements, m, is very small in relation to the total number of elements n. If this were 
not the case, we could efficiently find a marked element with high probability by 
simply checking a randomly chosen set of vertices. Since the set lacks any structure 
or ordering, the corresponding walk takes place on the complete graph on n vertices. 
Let us define the following three states: 

\UU) = - ^ |M,y) (30) 
? u.veu 

|f/M) = ^ ^ |m,v) (31) 



veM 

|Mt/> = -U E (32) 

veu 

Noting that {\UU) , \ UM) , \MU)} is an orthonormal set, we will consider the action 
of the walk operator on the three dimensional space 



r = span{\UU) , \UM) , \MU)}. 



(33) 
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In order to do this, we will express the spaces A and B, defined in ( [T2p3| ) in terms 
of a different basis. First we label the unmarked vertices: 



f/ = {mo,---,m^-i}- (34) 



Then, we define 



and 



1 Imjk 



1 t^' litijk 
V? ;=0 



V^:, (36) 



with k ranging from to ^ — 1. Note that |7o) corresponds to the definition of |;r) in 
( |25] l. We can then rewrite A and B: 

A^span{\yk)} (37) 

B = spanM)}. (38) 

Now, note that for k^O, the space F is orthogonal to \ jk) and | ) . Furthermore, 

\ya)^^{V^\UM)+^\UU)) (39) 

and 

m) = ^{V^i\MU) + ^\UU)). (40) 
Therefore, the walk operator 

W = (277*-/) (217-7) (41) 
when restricted to F is simply 

W^'=(2|^>(^|-/)(2|7o>(7o|-/). (42) 



Figure |4/L4 illustrates the space F which contains the vectors jyo) and [^fj). 

At this point, it is interesting to compare this algorithm to Grover's search algo- 
rithm. Define 

\p,)=W'\UU). (43) 

Now, define \p}) and \pf) to be the projection of |p,) onto span{\UU) ,\UM)} 
and span{\UU) , \MU)} respectively. We can think of these as the "shadow" cast by 
the vector |p,) on two-dimensional planes within the space F. Note that jyo) Ues in 
the span{\UU) , \UM)} and its projection onto span{\UU) , \MU)} is yfqJn\UU). 
So, the walk operator acts on |p/) by reflecting it around the vector |7o) and then 
around ^Jqjn \UU). This is very similar to Grover search, except for the fact that 
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\UM) 



Fig. 3 Tlie 3 dimensional space F. The walk operator acts on this space by alternately performing 
reflections in the vector \yf)) and j}^). 



the walk operator in this case does not preserve the magnitude of |pj). So, with each 
application of the walk operator, \p} ) is rotated by 20, where Q = XaxT^ {^J m / q) is 
the angle between \UU) and |7o). The case for \pf) is exactly analogous, with the 
rotation taking place in the plane span{ \ UU) , \MU) }. So, we can think of the quan- 
tum walk as Grover search taking place simultaneously in two intersecting planes. It 
should not come as a surprise, then, that we can achieve a quadratic speedup similar 
to that of Grover search. 

We will now use a slightly modified definition of hitting time to show that the 
walk can indeed be used to find a marked vertex in 0{^/n) time. Rather than using 



the hitting time as defined in section 4.1.2 we will replace the state \tz) = \ yo) with 



\UU). Note that we can create \UU) from \n) by simply measuring whether the 
second register contains a marked vertex. This is only a small adjustment, since |7o) 
is close to \UU). Furthermore, both lie in the space F, and the action of the waUc 
operator is essentially identical for both starting states. It should not be surprising to 



the reader that the results of section 4. 1 .2 apply to this modified definition as well. 



In this case, the operator P is defined by: 



if M = V 
Puv = < , (44) 
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Let VQ, v„_i be a labeling of the vertices of G. Let x", ^ denote the eigen- 

vectors of P, with xj^ denoting the amplitude on Vj in x'^. Then, the eigenvalues of P 



are as follows: 



xl^--e^. (45) 



1 lltijk 



Then, x'' has eigenvalue 1 , and is the stationary distribution. All the other have 
eigenvalue — l/(n — 1), giving a spectral gap of 5 = 'j^. Applying corollary 2 this 
gives us quantum hitting time for the corresponding quantum walk operator W 



Hq{W)<^^-"^^^0{V^). (46) 

So, we run the walk for some randomly selected time fG{0,...,r— 1} with T of size 
0{y/n), then measure whether either the first or second register contains a marked 
vertex. Applying theorem[3] the probability that neither contains a marked vertex is 

l^^i , 1^-1 



-j:\{p-m)H-j:\{pr\uu)\ 

^ t=0 ^ t=Q 

= ^-^L\\\Pr)-\UU) 



1 W 2 



<1 



r=0 

\M\ 



2V 



Assuming that \M\ is small compared to \V\, we find a marked vertex in either the 
first or second register with high probability. Repeating this procedure a constant 
number of times, our success probability can be forced arbitrarily close to 1. 



4,1.5 The MNRS Algorithm 

Magniez, Nayak, Roland and Santha [47] developed an algorithm that generalizes 
the search algorithm described above to any graph. A brief overview of this al- 
gorithm and others is also given in the survey paper by Santha ll56l . The MNRS 
algorithm employs similar principles to Grover's algorithm; we apply a reflection in 
M, the space of unmarked states, followed by a reflection in |;r), a superposition of 
marked and unmarked states. This facilitates a rotation through an angle related to 
the number of marked states. In the general case considered by Magniez et al., |7r) 
is the stationary distribution of the walk operator It turns out to be quite difficult to 
implement a reflection in |;r) exactly. Rather, the MNRS algorithm employs an ap- 
proximate version of this reflection. This algorithm requires O ( applications 

of the walk operator, where 5 is the eigenvalue gap of the operator P, and e is the 
proportion vertices that are marked. In his survey paper, Santha |56| outlines some 
applications of the MNRS algorithm, including a version of the element distinctness 
problem where we are asked to find elements x and y such that /(x) — f{y). 
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4.2 Continuous Time Quantum Walks 

To define a classical continuous time random walk for a graph with no loops, we 
define a matrix similar to the adjacency matrix of G, called the Laplacian: 

{0 u ^ v,uv ^ E 

1 M^v,MvG£ (47) 

—deg{v) u = v. 

Then, given a probability distribution p{t) on the vertices of G, the walk is defined 
by 

j^p{t)=Lp{t). (48) 

Using the Laplacian rather than the adjacency matrix ensures that p{t) remains nor- 
maUzed. A continuous time quantum walk is defined in a similar way. For simplicity, 
we will assume that the Laplacian is symmetric, although it is still possible to define 
the walk in the asymmetric case. Then, since the Laplacian is Hermitian, we can 
simply take it to be the Hamiltonian of our system. Letting \p{t)) be a normalized 
vector in C^, Schrodinger's equation gives us 

i^^\p{t))=L\p{t)). (49) 

Solving this equation, we get an explicit expression for |p(f)): 

\p{t))=e-'^\pm. (50) 

Let {|Ai) ., , , |A„)} be the eigenvectors of L with corresponding values {Ai, A„}. 
We can rewrite the expression for |p (?)) in terms of this basis as follows: 

|p(/)) = t^^-'V(A,-|po)|A,). (51) 

7=1 

Clearly, the behaviour of a continuous time quantum walk is very closely related to 
the eigenvectors and spectrum of the Laplacian. 

Note that we are not required to take the Laplacian as the Hamiltonian for the 
walk. We could take any Hermitian matrix we like, including the adjacency matrix, 
or the transition matrix of a (symmetric) Markov chain. 

4.2.1 A Continuous Time Walk for Unstructured Search 

For unstructured search, the walk takes place on a complete graph. First, we define 
the following three states: 
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1^) = ^ L (52) 



l^) = ^LK')- (53) 

V 1^1 

The Hamiltonian we will use is a slightly modified version of the Laplacian for the 
complete graph, with an extra "marking term" added: 

//= |y>(V| + |M)(M|. (54) 

It is convenient to consider the action of this Hamiltonian in terms of the vectors 
\M) and \M^), where 

As outlined in f^TI, we let a = We can then re- write the Hamiltonian in 

terms of the basis {\M) , |M-'-)}: 



=/ + a^cTz + a -a^ox (58) 

=/ + a(ac7z + - «2ox) (59) 

where dx and dz are the PauU X and Z operators. Note that the identity term in the 
sum simply introduces a global phase, and can be ignored. Note that the operator 

A^aoz^ V 1 - oJ^cTx (60) 
has eigenvalues ±1, and is therefore Hermitian and unitary. Therefore, we can write 

e"'^' =e"''*"' =cos(af)/-/sin(af)A. (61) 

Also note that 

A\S) ={aaz + \J\-a?Ox){a\M)^\J\-a} M^^) (62) 
= \M). (63) 

If we start with the state jS'), we can now calculate the state of the system at time f : 

|p(f)) =cos(af) |5) -/sin(ar)A|5) (64) 
= cos(m) \S) - /sin(af ) \M) . (65) 
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So, at time t, the probability of finding the system in a state corresponding to a 
marked vertex is 



At time f = 0, this is the same as sampHng from the uniform distribution, as we 
would expect. However, at time t = 7r/2a = njl- \J~nJm, we observe a marked 
vertex with probability 1. Therefore, this search algorithm runs in time 0{^JnJm), 
which coincides with the running time of Grover's search algorithm and the related 
discrete time quantum walk algorithm. 

4.2.2 Mixing in Quantum Walks and the Glued Tree Traversal Problem 

While continuous time quantum walks give us a generic quadratic speedup over 
their classical counterparts, there are some graphs on which the quantum walk gives 
exponential speedup. One such example is the "glued trees" of Childs et al. [23 J. In 
this example, the walk takes place on the following graph, obtained by taking two 
identical binary trees of depth d and joining their leaves using a random cycle, as 
illustrated in Figure]?] Beginning at the vertex ENTRANCE, we would like to know 



ENTRANCE 




Fig. 4 A small glued tree graph. 

how long we need to run the algorithm to reach EXIT. It is straightforward to show 
that classically, this will take time exponential in d, the depth of the tree. Intuitively, 
this is because there are so many vertices and edges in the "middle" of the graph, 
that there is small probability of a classical walk "escaping" to the exit. We will 




(66) 



a^cos^(af) + sin^(af). 



(67) 



Algorithms for Quantum Computers 



25 



prove that a continuous time quantum walk can achieve an overlap of Q, {P{d)) with 
the EXIT vertex in time O(gpj), where Q and P are polynomials. 

Let Vs denote the set of vertices at depth s, so that Vq = {entrance} and 
V2d+\ = {exit}. Taking the adjacency matrix to be our Hamiltonian, the operator 
U {t) = e~'"' acts identically on the vertices in Vg for any s. Therefore, the states 



s) = ^= y |v) (68) 




form a convenient basis. We can therefore think of the walk on G as a walk on the 
states {I*) : 0<i<2<i + l}. We also note that, if A is the adjacency matrix of G, 



A\s} 



'V2\s+l) s = 

V2\s-l) s = 2d+l 

V2|s-l)+2|s + l) s = d (69) 

V2\s+l)+2\s-l) s = d+l 

V2\s + 1) + V2\s - 1) otherwise. 



Aside from the exceptions at entrance, exit, and the vertices in the center, this 
walk looks exactly Uke the walk on a line with uniform transition probabilities. 

Continuous time classical random walks will eventually converge to a limiting 
distribution. The time that it takes for the classical random walk to get 'close' to 
this distribution is called the mixing time. More formally, we can take some small 
constant y, and take the mixing time to be the amount of time time it take to come 
within 7 of the stationary distribution, according to some metric. In general, we 
express the mixing time in terms of 1 /y and n, the number of vertices in the graph. 

Since quantum walks are govemed by a unitary operator, we cannot expect the 
same convergent behaviour. However, we can define the hmiting behaviour of quan- 
tum walks by taking an average over time. In order to do this, we define Pr{u, v, 7"). 
If we select f G [0,T] uniformly at random and run the walk for time t, beginning 
at vertex u, then Pr(M, v, T) is the probability that we find the system in state v. For- 
mally, we can write this as 



Pr{u,v,T) = ^J^^\{v\e~'^\u)\'dt 



(70) 



where A is the adjacency matrix of the graph. Now, if we take {|A)} to be the 
set of eigenvectors of A with corresponding eigenvalues {A}, then we can rewrite 
Pr(M,v,r) as follows: 
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Pr(M,v,r)-- 

T Jo 



2 

iXt 



l^e-'^'{v\X){X\u) 



dt (71) 



(v|A) (A |m) (v|A') (A'|m)) t/f (72) 



In particular, we have 



= El(v|A)(A|«)|2 (73) 
X 

+ i 52 (v|A)(A|«)(v|A')(A'|«) (74) 

= £|(v|A)(A|«)p (75) 
X 

+ E (v|A)(A|«)(v|AO(A>) ^ (76) 
x%' i{X-X')T 



limPr(M,v,r)=El(v|A)(A|M)p. (77) 



We will denote this value by Pr(M,v, oo). This is the quantum analogue of the 
limiting distribution for a classical random walk. We would now like to apply 
this to the specific case of the glued tree traversal problem. First, we will lower 
bound Pr(ENTRANCE,EXlT,oo). We will then show that Pr(ENTRANCE, EXIT, T) 
approaches this value rapidly as we increase T, implying that we can traverse the 
glued tree structure efficiently using a quantum walk. 
Define the reflection operator & by 

&\j) = \2d-\-j) (78) 

This operator commutes with the adjacency matrix, and hence the walk operator 
because of the symmetry of the glued trees. This implies that can be diagonalized 
in the eigenbasis of the walk operator e^'^' for any t. What is more, the eigenvalues 
of are ±1. As a result, if |A) is an eigenvalue of e^'^' , then 

(A I entrance) = ± (A I exit) . (79) 

We can apply this to (|77]), yielding 

Pr(ENTRANCE,EXIT,oo) = ^ | (eNTRANCE|A) I'* (80) 

X 

>^^LKentRANCe|A)P (81) 
1 

^2d + 2' 



(82) 
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Now, we need to determine how quickly Pr(ENTRANCE, EXIT, T) approaches Pr(ENTRANCE, EXIT, oo) 
as we increase T: 

|Pr(ENTRANCE,EXIT,r) -Pr(ENTRANCE,EXIT,oo)| (83) 



y (exit|A) (A |entrance) (exit|A') (A'|entrance) 

A/A' i(A,-A)T 

2 l_g-i{l-f~')T 



(84) 



< y |(A|entrance)N(A'|entrance)| -— — (85) 

<^ (86) 

where 5 is the difference between the smallest gap between any two distinct eigen- 
values of A. As a result, we get 

1 2 

Pr(ENTRANCE, EXIT, T) > — - — . (87) 

2d— I To 

Childs et al. l|23l show that 5 is. Q.{\ /d^). Therefore, if we take T of size Oid^), we 
get success probability 0{\/d). Repeating this process, we can achieve an arbitrarily 
high probability of success in time polynomial in d — an exponential speedup over 
the classical random walk. 



4.2.3 AND-OR Tree Evaluation 

AND-OR trees arise naturally when evaluating the value of a two player combi- 
natorial game. We will call the players Pq and Pi. The positions in this game are 
represented by nodes on a tree. The game begins at the root, and players alternate, 
moving from the root towards the leaves of the tree. For simplicity, we assume that 
we are dealing with a binary tree; that is, for each move, the players have exactly 
two moves. While the algorithm can be generalizes to any approximately balanced 
tree, but we consider the binary case for simplicity. We also assume that every game 
lasts some fixed number of turns d, where a turn consists of one player making a 
move. We denote the total number of leaf nodes by n = 2''. We can label the leaf 
nodes according to which player wins if the game reaches each node; they are la- 
beled with a if fo wins, and a 1 if Pi wins. We can then label each node in the 
graph by considering its children. If a node corresponds to Pq's turn, we take the 
AND of the children. For Pi 's move, we take the OR of the children. The value at the 
root node tells us which player has a winning strategy, assuming perfect play. 
Now, since 



K^T){x\,...,Xk) =NAND(NAND(jt:i,...,Jt:|t)) 
OR(jt:i,...,Xi) ==NAND(nAND(jci ),..., NAND(jC|(.)) 



(88) 
(89) 
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we can rewrite the AND-OR tree using only NAND operations. Furthermore, since 

NOT(jt:i) = NAND(jt:i) (90) 

rather than label leaves with a 1, we can insert an extra node below the leaf and 
label it 0. Also, consecutive NAND operations cancel out, and will be removed. This 
is illustrated in figure [5] Note that the top-most node will be omitted from now on, 
and the shaded node will be referred to as the ROOT node. Now, the entire AND-OR 
tree is encoded in the structure of the NAND tree. We will write NAND(y) to denote 
the NAND of the value obtained by evaluating the tree up to a vertex v. The value of 
the tree is therefore NAND(root). 

N 



A N 




N N 

(b (3) 

Fig. 5 An AND-OR tree and the equivalent NAND tree, with example values. Note that the top-most 
node will be omitted from now on, and the shaded node in the NAND tree will be referred to as the 
ROOT node. 



The idea of the algorithm is to use the adjacency matrix of the NAND tree as the 
Hamiltonian for a continuous time quantum walk. We will show that the eigenspec- 
trum of this walk operator is related to the value of NAND(root). This idea first 
appeared in a paper of Farhi, Goldman and Gutmann 1*291, and was later refined by 
Ambainis, Childs and Reichardt |9]. We begin with the following lemma: 

Lemma 1. Let |A) be an eigenvector of A with value A. Then, ifv is a node in the 
NAND tree with parent node p and children C, 

{p\X)^~l{v\X) + Y^{c\l) (91) 

cec 

Therefore, if A has an eigenvector with value 0, then 
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{p\A^) = Y^{c\Xo) (92) 

cec 

Using this fact and some inductive arguments, Ambainis, Childs and Reichardt ||9] 
prove the following theorem: 

Theorem 4. /f NAND(root) = 0, then there exists an eigenvector \Xq) of A with 
eigenvalue such that |(root|Ao)| > Otherwise, //nand(root) = 1, then for 

any eigenvector \X) of A with {X\A \X) < we have (root|Ao) = 0. 

This result immediately leads to an algorithm. We perform phase estimation with 
precision 0(1/ y/n) on the quantum walk, beginning in the state |root). If NAND(root) 
0, get a phase of with probability > j. If NAND(root) = 1, we will never get a 
phase of 0. This gives us a running time of 0{y/n). While we have restricted our 
attention to binary trees here, this result can be generalized to any m-ary tree. 

It is worth noting that unstructured search is equivalent to taking the OR of n 
variables. This is a AND-OR tree of depth 1, and n leaves. The 0{^/n) running time of 
Grover's algorithm corresponds to the running time for the quantum walk algorithm. 

Classically, the running time depends not just on the number of leaves, but on the 
structure of the tree. If it is a balanced m-ary tree of depth d, then the running time 
is 

01 I - 1 + + , (93) 




In fact, the quantum speedup is maximal when we have an n-ary tree of depth 1. 
This is just unstructured search, and requires Q{n) time classically. 

Reichardt and Spalek 1551 generalize the AND-OR tree problem to the evalu- 
ation of a broader class of logical formulas. Their approach uses span programs. 
A span program P consists of a set of target vector f, and a set of input vectors 
{vj, v[, ...,vj),v^,} corresponding to logical literals {xi^xi^...,x„,x^}. The program 
corresponds to a boolean function fp : {0, 1}" {0, 1} such that, for a G {0, 1}", 
/(a) = 1 if and only if 

j=i 

Reichardt and Spalek outline the connection between span programs and the evalu- 
ation of logical formulas. They show that finding CJ € /^'(l) is equivalent to find- 
ing a zero eigenvector for a graph Gp corresponding to the span program P. In this 
sense, the span program approach is similar to the quantum walk approach of Childs 
et al. — both methods evaluate a formula by finding a zero eigenvector of a corre- 
sponding graph. 
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5 Tensor Networks and Their Applications 

A tensor network consists of an underlying graph G, with an algebraic object called 
a tensor assigned to each vertex of G. The value of the tensor network is calculated 
by performing a series of operations on the associated tensors. The nature of these 
operations is dictated by the structure of G. At their simplest, tensor networks cap- 
ture basic algebraic operations, such as matrix multiplication and the scalar product 
of vectors. However, their underlying graph structure makes them powerful tools for 
describing combinatorial problems as well. We will explore two such examples — 
the Tutte polynomial of a planar graph, and the partition function of a statistical 
mechanical model defined on a graph. As a result, the approximation algorithm that 
we will describe below for the value of a tensor network is implicitly an algorithm 
for approximating the Tutte polynomial, as well as the partition function for these 
statistical mechanics models. We begin by defining the notion of a tensor. We then 
outline how these tensors and tensor operations are associated with an underlying 
graph structure. For a more detailed account of this algorithm, the reader is referred 
to ifTOl . Finally, we will describe the quantum approxiamtion algorithm for the value 
of a tensor network, as well as the appUcations mentioned above. 

5.0.4 Tensors: Basic Definitions 

Tensors are formally defined as follows: 

Definition 3. A tensor M of rank m and dimension ^ is an element of C'" . Its entries 
are denoted by Mj^j^^,,, , where < < ^ — 1 for all . 

Based on this definition, a vector is simply a tensor of rank 1, while a square matrix 
is a tensor of rank 2. We will now define several operations on tensors, which will 
generalize many familiar operations from linear algebra. 

Definition 4. Let M and be two tensors of dimension q and rank m and n respec- 
tively. Then, their product, denoted M (g) A^, is a rank m + n tensor with entries 



This operation is simply the familiar tensor product. While the way that the entries 
are indexed is different, the resulting entries are the same. 

Definition 5. Let M be a tensor of rank m and dimension q. Now, take a and b with 
\ <a <b <m. The contraction of M with respect to a and is a rank m — 2 tensor 
A' defined as foUows: 



{M®N) 



■kn -^Jl j,„ -^ki k„- 



(94) 



9-1 



71 •■■■Jci-ukJa^l,...jt-l,k.jl,+ i,---Jm ■ 



(95) 
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One way of describing this operation is that each entry in the contracted tensor is 
given by summing along the "diagonal" defined by a and b. This operation general- 
izes the partial trace of a density operator. The density operator of two qubit system 
can be thought of as a rank 4 tensor of dimension 2. Tracing out the second qubit is 
then just taking a contraction with respect to 3 and 4. It is also useful to consider the 
combination of these two operations: 

Definition 6. If M and A' are two tensors of dimension q and rank m and n, then for 
a and b with I <a<m and 1 < < n, the contraction of M and A'^ is the result of 
contracting the product M^N with respect to a and m + b. 

We now have the tools to describe a number of familiar operations in terms of tensor 
operations. For example, the inner product of two vectors can be expressed as the 
contraction of two rank 1 tensors. Matrix multiplication is just the contraction of 2 
rank 2 tensors M and N with respect to the second index of M and the first index of 
A'^. Finally, if we take a Hilbert space H = C^, then we can identify a tensor M of 
dimension q and rank m with a linear operator M*'' : //®' //®* where s + t = m: 

M'''= £ Mj^„„jJji)(g)...(g)\js){js+i\(»...(^{M\- (96) 

This correspondence with linear operators is essential to understanding tensor net- 
works and their evaluation. 



5.1 The Tensor Network Representation 

A tensor network T{G,M) consists of a graph G = {V,E) and a set of tensors M = 
{M[v] : V e V}. A tensor is assigned to each vertex, and the structure of the graph 
G encodes the operations to be performed on the tensors. We say that the value of 
a tensor network is the tensor that results from applying the operations encoded by 
G to the set of tensors M. When the context is clear, we will let T{G,M) denote 
the value of the network. In addition to the typical features of a graph, we allow 
G to have free edges — edges that are incident with a vertex on one end, but are 
unattached at the other. For simplicity, we will assume that all of the tensors in M 
have the same dimension q. We also require that deg{v), the degree of vertex v, is 
equal to the rank of the associated tensor M[v] . If G consists of a single vertex v with 
m free edges, then T{G,M) represents a single tensor of rank m. Each index of M 
is associated with a particular edge. It will often be convenient to refer to the edge 
associated with an index a as gj,. We now define a set of rules that will allow us to 
construct a tensor network corresponding to any sequence of tensor operations: 

• If M and A'^ are the value of two tensor networks T(G,M) and T{H,N), then 
taking the product M (E)N is equivalent to taking the disjoint union of the two 
tensor networks, T{GUH,MUN) 

• If M is the value of a tensor network T{G,M), then contracting M with respect 
to a and b is equivalent to joining the free edges Ca and Ch in G. 
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• As a result, taking the contraction of M and with respect to a and b corresponds 
to joining the associated edge from T{G,M) and ei, from T{H,N) 

Some examples are illustrated in figure |6] Applying these simple operations itera- 
tively allows us to construct a network corresponding to any series of products and 
contractions of tensors. Note that the number of free edges in T{G,M) is always 
equal to the rank of the corresponding tensor M. We will be particularly interested 
in tensor networks that have no free edges, and whose value is therefore a scalar 




Fig. 6 The network representation of a tensor M of rank 4, and the contraction of two rank 4 
tensors, M and N. 



We can now consider the tensor network interpretation of a linear operator M^'' 
defined in (96 1. M' ' is associated with a rank m = s + t tensor M. An equivalent 
tensor network could consist of a single vertex with m free edges (ei, ...,em)- The 
operator M^ ' acts on elements of (C^)®', which are rank t tensors, and can therefore 
be represented a single vertex with f free edges (e'j . ...,ej). The action of M* ' on an 
element A^e (C*)®' is therefore represented by connecting e[ to gj^^- for A; = l,...,f. 
Figure|7]illustrates the operator M^'^ acting on a rank 2 tensor. Note that the resulting 
network has two free edges, corresponding to the rank of the resulting tensor. 




We now return to the tensor networks we will be most interested in — those with 
no free edges. Let us first consider the example of an inner product of two rank 1 
tensors. This corresponds to a graph consisting of two vertices u and v joined by a 
single edge. The value of the tensor is then 
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T{G,M)^t(Mu)j-{M[v])j (97) 

;=o 

That is, it is a sum of q terms, each of which corresponds to the assignment of an 
element of {0, ■■■,q— 1} to the edge (m, v). We can refer to this assignment as an 
g'-edge colouring of the graph. In the same way, the value of a more complex tensor 
network is given by a sum whose terms correspond to ^-edge colourings of G. Given 
an colouring of G, let M[y]'' = M[y],j where ii ,...,i,n are the values assigned by 
7 to the edges incident at v. That is, a ^-edge colouring specifies a particular entry of 
each tensor in the network. Then, we can rewrite the value of r(G,M) as follows; 




where 7 runs over all ^-edge colourings of G. Thinking of the value of a tensor 
network as a sum over all ^-edge colourings of G will prove useful when we consider 
applications of tensor networks to statistical mechanics models. 



5.2 Evaluating Tensor Networks: the Algorithm 

In this section, we will first show how to interpret a tensor network as a series of 
linear operators. We will then show how to apply these operators using a quantum 
circuit. Finally, we will see how to approximate the value of the tensor network 
using the Hadamard test. 

5.2.1 Overview 

In order to evaluate a tensor network r(G,M), we first give it some structure, by 
imposing an ordering on the vertices of G, 

y(G) = {vi,...,v„}. 

We further define the sets Sj = {vi, V;}. This gives us a natural way to depict the 
tensor network, which is shown in figure [8] Our evaluation algorithm will "move" 
from left to right, applying linear operators corresponding to the tensors M^.■. We 
can think of the edges in analogy to the "wires" of a quantum circuit diagram. To 
make this more precise, let v; be a vertex of degree d. Let f be the number of vertices 
connecting v, to 1 and s be the number of edges connecting v, to V (G) \ Si. Then, 
the operator that is applied at v, is My' . Letting j be the number of edges from 
to y (G) \ Si, we apply the identity operator Ij on the j indices that correspond to 
edges not incident at v, . Combining these, we get an operator 
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Fig. 8 Part of the graph G, with an ordering on V{G). 



Ni^M'^.f(g)Ij. (99) 

Note that, taking the product of these operators, H/Li gives the value of r(G,M). 
However, there are a few significant details remaining. Most notably, the operators 
A^,- are not, in general, unitary. In fact, they may not even correspond to square ma- 
trices. We will now outhne a method for converting the A^, into equivalent unitary 
operators. 



5.2.2 Creating Unitary Operators 

Our first task is to convert the operators A^, into "square" operators — that is, op- 
erators whose domain and range have the same dimension. Referring to (99 1, the 
identity is already square, so we need only modify My' . In order to do this, we 
will add some extra qubits in the |0) state. We consider three cases: 

1 . If 5 = f , we define Mtf = M^:' 

2. If 5 > f , then we add s — t extra qubits in the |0) state, and define Myf such that 

A<(|v/)(»|0)®(-^-')) =M;;'|r). (100) 

For completeness, we say that if the last s — t qubits are not in the |0) state, M^f 
takes them to the vector. 

3. If i < f , then we define 

M^'\xif) = {M:f\xif))^\Of^'-'\ (101) 

Finally, we define 

Ni=Mt,f(g)lj, (102) 

which is a square operator Now, to derive corresponding unitary operators, we make 
use of the following lemma: 
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Lemma 2. Let M be a linear map A : H^'' — > H®\ where H is a Hibert space H = 
C^. Furthermore, let G — be a spanned { |0) , 1 1 ) }. Then, there exists a unitary 
operator Um ■ {H^'' ® G) ^ (//®' ® G) such that 



UMi\v)(^\o)) 



1 



M 



.M\\(f)(g)\0) + \(j))(g)\l) 



(103) 



Um can be implemented on a quantum computer in poly{q') time. 

A proof can be found in 1 10] as well as [4]. Applying this lemma, we can create n 
unitary operators Ufj. for I < j <n, and define 



7=1 



It is easily verified that 



T{G,M) 



(104) 



(105) 



where r is dependent on the number of vertices n as well as the structure of G and 
the ordering of the vertices vi , v„. 



5.2.3 The Hadamard Test 

In order to approximate (0|®'^J/ jO)®*^, most authors suggest the Hadamard test — 
a well-known method for approximating the weighted trace of a unitary operator. 
First, we add an ancillary qubit that will act as a control register We then apply the 
circuit outlined below, and measure the ancillary qubit in the computational basis. 



|0) 
\¥) 



H 



H 



U 



Fig. 9 The quantum circuit for the Hadamard test. 



It is not difficult to show that we measure 



|0) with probability 2 



1 1) with probability -{l - Re {y/\U \y/)) . 
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So, if we assign a random variable X such that X = I when we measure |0) and 
X — — 1 when we measure |1), thenZ has expected value Re (y/j U\\l/). So, in order 
to approximate Re (\//| U\\l/) to a precision of e with constant probability p > 1/2, 
we require 0{e^^) repetitions of this protocol. In order to calculate the imaginary 
portion, we apply the gate 

'1 



R = 







to the ancillary qubit, right after the first Hadamard gate. It is important to note 
that the Hadamard test gives an additive approximation of (0|®'^t/ |0)®'^. Additive 
approximations will be examined in section [5".2.5| 



5.2.4 Approximating (0|®'^t/ 10)®' Using Amplitude Estimation 

In order to estimate (0|®'^t/ 10)®'^, most of the literature applies the Hadamard test. 
However, it is worth noting that we can improve the running time by using the 
process of amplitude estimation, as outlined in section |3] Using the notation from 
section |3] we have Uf — Uq and A ~ U, the unitary corresponding to the tensor 
network r(G,M). We begin in the state f/|0)®', and our search iterate is 

Q = -UUoU-^Uo. (106) 

In order to approximate (0|®'^f/ 10)®'^ to a precision of e, our running time is in 
0(1 /e), a quadratic improvement over the Hadamard test. 



5.2.5 Additive Approximation 

Let / : 5 — > C be a function that we would like to evaluate. Then, an additive ap- 
proximation A with approximation scale 4 : 5 — > R is an algorithm that, on input 
X G S and parameter e > , outputs A{x) such that 

Pr(|A(x)-/(x)| >ezi(ji:)) <c (107) 

for some constant c with < c < 1/2. If 4 (x) is 0{f{x)), then A is afully polynomial 
randomized approximation scheme, or FPRAS. 

We will now determine the approximation scale for the algorithm outlined above. 
Using amplitude estimation, we estimate 

{Oru\Of''= ^^!^'y„ (108) 

n/||M;/|| 

to within e with time requirement in 0{l/e). However, the quantity we actually 
want to evaluate is T{G,M), so we must multiply by Ylj \ Therefore, our 

algorithm has approximation scale 
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4(G,M)=niK'|| (109) 

i 

We apply lemma|2]to each vertex in G, and require 0{l/e) repetitions of the algo- 
rithm to approximate r(G,M) with the desired accuracy using amplitude estima- 
tion. This gives an overall running time 

o(poly(^^(«)).|y(G)|-e) (110) 

where D{G) is the maximum degree of any vertex in G. The algorithm is, therefore, 
an additive approximation of r(G,M). 



5.3 Approximating the Tutte Polynomial for Planar Graphs Using 
Tensor Networks 



In 2000, Kitaev, Freedman, Larsen and Wang 11321 [311 demonstrated an efficient 
quantum simulation for topological quantum field theories. In doing so, they im- 
plied that the Jones polynomial can be efficiently approximated at e^™l^ . This sug- 
gests that efficient quantum approximation algorithms might exist for a wider range 
of knot invariants and values. Aharonov, Jones and Landau developed such a Jones 
polynomial for any complex value [5 1. Yard and Wocjan's developed a related algo- 
rithm for approximating the HOMFLYPT polynomial of a braid closure |64|. Both 
of these knot invariants are special cases of the Tutte polynomial. While the tensor 



network algorithm in section 5.3.2 follows directly from a later paper of Aharonov 
et al. H, it owes a good deal to these earlier results for knot invariants. 

We will give a definition of the Tutte polynomial, as well as an overview of its 
relationship to tensor networks and the resulting approximation algorithm. 



5.3.1 The l\itte Polynomial 

The multi-variate Tutte Polynomial is defined for a graph G ~ (V,/?) with edge 
weights w = {wg} and variable q as follows: 

zg(?,w)= i^/t^'n^^ (111) 

where k{A) denotes the number of connected components in the graph induced by 
A. The power of the Tutte polynomial arises from the fact that it captures nearly all 
functions on graphs defined by a skein relation. A skein relation is of the form 



!{G)=x-f{Gle)^yf{G\e) 



(112) 
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where Gje is created by contracting an edge e and G\e is created by deleting e. 
Oxley and Welsh |62J show that, with a few additional restrictions on /, if / is 
defined by a skein relation, then computing / can be reduced to computing Zq. It 
turns out that many functions can be defined in terms of a skein relation, including 

1. The partition functions of the Ising and Potts models from statistical physics. 

2. The chromatic and flow polynomials of a graph G. 

3. The Jones Polynomial of an alternating link. 

The exact evaluation of the Tutte polynomial, even when restricted to planar graphs, 
turns out to be #P-hard for all but a handful of values of q and w. An exact algo- 
rithm is therefore believed to be impossible for a quantum computer. Indeed, even 
a polynomial time FPRAS seems very unlikely for the Tutte polynomial of a gen- 
eral graph and any parameters q and w. The interesting question seems to be which 
graphs and parameters admit an efficient and accurate approximation. By describ- 
ing the Tutte polynomial as the evaluation of a Tensor network, we immediately 
produce an additive approximation algorithm for the Tutte polynomial. However, 
we do not have a complete characterization of the graphs and parameters for which 
this approximation is non-trivial. 

5.3.2 The Tutte Polynomial as a Tensor Network 

Given a planar graph G, and an embedding of G in the plane, we define the medial 
graph Lq. The vertices of Lq are placed on the edges of G. Two vertices of Lq are 
joined by an edge if they are adjacent on the boundary of a face of G. If G contains 
vertices of degree 2, then Lq will have multiple edges between some pair of vertices. 
Also note that Lq is a regular graph with valency 4. An example of a graph and its 
associated medial graph is depicted in figure [TO] 

In order to describe the Tutte polynomial in terms of a tensor network, we make 
use of the generalized Temperley-Lieb algebra, as defined in |4|. The basis elements 
of the Temperley-Lieb algebra GTL{d) can be thought of as diagrams in which m 
upper pegs are joined to n lower pegs by a series of strands. The diagram must 
contain no crossings or loops. See figure [TT|for an example of such an element. Two 
basis elements are considered equivalent if they are isotopic to each other or if one 
can be obtained from the other by 'padding' it on the right with a series of vertical 
strands. The algebra consists of all complex weighted sums of these basis elements. 
Given two elements of the algebra Ti and 72, we can take their product T2 ■ Ti by 
simply placing T2 on top of Ti and joining the strands. Note that if the number of 
strands do not match, we can simply pad the appropriate element with some number 
of vertical strands. As a consequence, the identity element consists of any number of 
vertical strands. In composing two elements in this way, we may create a loop. Let 
us say that Ti ■ T2 contains one loop. Then, if Tj is the element created by removing 
the loop, we define T\ T2 = dT^, where d is the complex parameter of GTL{d). 

We would also like this algebra to accommodate drawings with crossings. Let 
Ti be such a diagram, and T2 and be the diagrams resulting from 'opening' the 
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Fig. 10 A graph G and the resulting medial graph Lq- 



9 9 




o o o o 



Fig. 11 An element of the Temperley-Lieb algebra with 6 upper pegs and 4 lower pegs. 



crossing in the two ways indicated in figure 12 Then, we define T\ = aT2+bTj, for 
appropriately defined a and b. 

If we think of Lq as the projection of a knot onto the plane, where the vertices 
of Lc are the crossings of the knot, we see that Lq can be expressed in terms of 
the generalized Temperly-Lieb algebra. We would like to take advantage of this in 
order to map Lq to a series of tensors that will allow us to approximate the Tutte 
polynomial of G. Aharonov et. al. define such a representation p of the Temperley 
Lieb algebra. If T e GTL(d) is a basis element with m lower pegs and n upper pegs. 
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= a 





Fig. 12 An element that contains a crossing is equated to a weighted sum of the elements obtained 
by 'opening' the crossing. 



then p is a linear operator such that 



H 



(113) 



where H = for some k (which depends on the particular embedding of G) such 
that 

1. p preserves multiplicative structure. That is, if T\ has m upper pegs and T2 has m 
lower pegs, then 

piT2-Ti)^p{T2)-p{Ti). (114) 



2. p is hnear. That is. 



p{aTi +m) = ap{Tx) + ^p{T2). 



(115) 



For our purposes, k can be upper bounded by the number of edges [^(Lg)! 



2 [^(G)!, and corresponds to the value r in equation ( 105 1. To represent Lq, each 



crossing (vertex) of Lq is associated with a weighted sum of two basis elements, 
and therefore is represented by a weighted sum of the corresponding linear opera- 
tors, p{T\) = ap{T2) +bp{T^). The minima and maxima of Lq are also basis ele- 
ments of GTL{d), and can be represented as well. Finally, since Lq is a closed loop, 
and has no 'loose ends', p (Lg) must be a scalar multiple of the identity operator on 
H — C'^. This relates well to our notion that the value of a tensor network with no 
loose ends is just a scalar 

We would like this scalar to be the Tutte polynomial of the graph G. In order to do 
this, we need to choose the correct values for a, b and d. Aharonov et. al. show how 
to choose these values so that the scalar is a graph invariant called the Kauffman 
bracket, from which we can calculate the Tutte polynomial for planar graphs. So, 
we can construct a tensor network whose value is the Kauffman bracket of Lq, and 
thus gives us the Tutte polynomial of G by the following procudure: 



1. Construct the medial graph Lq and embed it in the plane. 
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2. Let L'q be constructed by adding a vertex at each local minimum and maximum 
of Lq. We need these vertices because we need to assign the linear operator 
associated with each minimum/maximum with a vertex in the tensor network. 

3. Each vertex v of L'q has a Temperley-Lieb element Ty associated with it. For some 
vertices, this is a crossing; for others, it is a local minimum or maximum. Assign 
the tensor M[v] = p{Ty,) to the vertex v. 

The tensor network we are interested in is then r(LQ,M), where M = {M[v\ : v E 
y (L'g)}. Approximating the value of this tensor network gives us an approximation 
of the Kauffman bracket of Lq, and therefore of the Tutte polynomial of G. Further- 
more, we know that L'q has maximum degree 4. Finally, we will assume that giving 
a running time of 

0(poly(^4).|£(G) !•£-') (116) 

In this case, q depends on the particular embedding f Lq, but can be upper bounded 
by \E\. For more details on the representation p and quantum approximations of the 
Tutte polynomial as well as the related Jones polynomial, the reader is referred to 
the work of Aharonov et al., in particular (|3l and Q. 



5.4 Tensor Networks and Statistical Mechanics Models 

Many models from statistical physics describe how simple local interactions be- 
tween microscopic particles give rise to macroscopic behaviour See [11 J for an 
excellent introduction to the combinatorial aspects of these models. The models we 
are concerned with here are described by a weighted graph G = {V,E).ln this graph, 
the vertices represent particles, while edges represent an interaction between parti- 
cles. A configuration (7 of a g'-state model is an assignment of a value from the set 
{0, — 1} to each vertex of G. We denote the value assigned to v by a,,. For each 
edge e — {u,v), we define a local Hamiltonian h^{au, ffv) G C. The Hamiltonian for 
the entire system is then given taking the sum: 

Hio)^ h,{a,„a„) (117) 

e=iu.v) 

The sum runs over all the edges of G. The partition function is then defined as 

ZQip)=Y,e-P"^'''> (118) 
a 

where (5 = 1/kT is referred to as the inverse temperature and k is Boltzmann's con- 
stant. The partition function is critical to understanding the behaviour of the system. 
Firstly, the partition function allows us to determine the probability of finding the 
system in a particular configuration, given an inverse temperature j3 : 

PrM^^ (1.9) 
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This probability distribution is known as the Boltmann distribution. Calculating the 
partition function also allows us to derive other properties of the system, such as 
entropy and free energy. An excellent discussion of the partition function from a 
combinatorial perspective can be found in |62|. 

We will now construct a tensor network whose value is the partition function Zq. 
Given the graph G, we define the vertices of G' — (V',/?') as follows: 

y' = yu{v, :ee£} (120) 

We are simply adding a vertex for each edge e of G, and identifying it by v^. We 
then define the edge set of G': 

£' = {(x, vv), (v„y) : (x,3') = e G £} (121) 

So, the end product G' resembles the original graph; we have simply added vertices 
in the middle of each edge of the original graph. The tensors that will be identified 
with each vertex will be defined separately for the vertex set V of the original graph 
and the set Ye = {ve} of vertices we added to define G'. In each case, they will be 
dimension q tensors for a ^-state model. For v GV, M[v] is an 'identity' operator; 
that is, it takes on the value 1 when all indices are equal, and otherwise: 

I otherwise 

The vertices in Vg encode the actual interactions between neighbouring particles. 
The tensor associated with is 

{M,.Xr=e-P"'^''''> (123) 

Now, let us consider the value of the tensor network in terms of <7-edge colourings 
of G: 

r(G',M) = £( n^M'l (124) 

r \vev' J 

where 7 runs over all ^-edge colourings of G'. Based on the definitions of the tensors 
M[v] for V G y, we see that IIvgv'^M'^ is non-zero only when the edges incident at 
each vertex are all coloured identically. This restriction ensures that each non-zero 



term of the sum ( 124 1 corresponds to a configuration a of the g'-state model. That is, 
a configuration a corresponds to a ^-edge colouring 7 of G' where each 7(e) = Cy 
whenever e is incident with v e V. This gives us the following equality: 
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r \vev' / 

O" e={u,v) 

=Zg(/3) 

We now consider the time required to approximate the value of this tensor network. 
Recall that the running time is given by 

O (poly(^^(^'') • |y(G')| • (125) 

First, we observe that, if we assume that G is connected, then |y(G')| is Od/sl). 
We have not placed any restrictions on the maximum degree D{G'). The vertices 
of G' of high degree must be in V, since all vertices in Ve have degree 2. Now, 
the tensors assigned to vertices of V are just identity operators; they ensure that the 
terms of the sum from ( |124[ ) are non-zero only when all the edges incident at v are 
coloured identically. As a result, we can replace each vertex v eV with deg(v) > 3 



by (deg(v) — 2) vertices, each of degree 3. See figure 13 for an example. The tensor 




assigned to each of these new vertices is the identity operator of rank 3. It is not 
difficult to show that this replacement does not affect the value of the tensor network. 
It does, however, affect the number of vertices in the graph, adding a multiplicative 
factor to the running time, which can now be written 

0(poly(^).D(G)-|£|-e-i) (126) 

For more detail on the approximation scale of this algorithm, the reader is referred 

tonni. 

Arad and Landau also discuss a useful restriction of ^-state models called dif- 
ference models. In these models, the local Hamiltonians depend only on the dif- 
ference between the states of the vertices incident with e. That is, h(.{Ou, (J,,) is re- 
placed by he{\Ou — CJvl), where |(J„ — (7,,| is calculated modulo q. Difference models 



44 



Jamie Smith and Michele Mosca 



include the well-known Potts, Ising and Clock models. Arad and Landau show that 
the approximation scale can be improved when we restrict our attention to difference 
models. 

Van den Nest et al. 1251 l40l show that the partition function for difference models 
can be described as the overlap between two quantum states. 



Van den Nest uses the state | y/c) to encode the structure of the graph, while each 
state \ae) encodes the strength of the local interaction at e. The actual computation 
of this overlap is best described as a tensor network. While this description does 
not yield a computational speedup over the direct method described above, it is 
an instructive way to deconstruct the problem. In the case of planar graphs, it also 
relates the partition function of a graph G and its planar dual. 

Geraci and Lidar ||34l |35l show that the Potts partition may be efficiently and 
exactly evaluated for a class of graphs related to irreducible cyclic cocycle codes. 
While their algorithm does not employ tensor networks, it is interesting to consider 
the implications of their results in the context of tensor networks. Is there a class of 
tensor networks whose value can be efficiently and exactly calculated using similar 
techniques? 



6 Conclusion 

In this chapter, we have reviewed quantum algorithms of several types: those based 
on the quantum Fourier transform, amplitude amplification, quantum walks, and 
evaluating tensor networks. This is by no means a complete survey of quantum al- 
gorithms; most notably absent are algorithms for simulating quantum systems. This 
is a particularly natural application of quantum computers, and these algorithms 
often achieve an exponential speedup over their classical counterparts. 

Algorithms based on the quantum Fourier transform, as reviewed in section |2] 
include some of the earliest quantum algorithms that give a demonstrable speedup 
over classical algorithms, such as the algorithms for Deutsch's problem and Simon's 
problem. Many problems in this area can be described as hidden subgroup problems. 
Despite the fact that problems of this type have been well-studied, many questions 
remain open. For example, there is no efficient quantum algorithm known for most 
examples of non-Abelian groups. In particular, the quantum complexity of the graph 
isomorphism problem remains unknown. 

Similarly, the family of algorithms based on amplitude amplification and estima- 
tion have a relatively long history. While Grover's original searching algorithm is 
the best-known example, this approach has been greatly generalized. For example, 
in ifTBJ it is shown how the same principles applied by Grover to searching can be 




(127) 
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used to perform amplitude estimation and quantum counting. Amplitude amplifica- 
tion and estimation have become ubiquitous tools on quantum information process- 
ing, and are often employed as critical subroutines in other quantum algorithms. 

Quantum walks provide an interesting analogue to classical random walks. While 
they are inspired by classical walks, quantum walks have many properties that set 
them apart. Some applications of quantum walks are natural and somewhat unsur- 
prising; using a walk to search for a marked element is an intuitive idea. Others, such 
as evaluating AND-OR trees are much more surprising. Quantum walks remain an 
active area of research, with many open questions. For example, the relative compu- 
tational capabilities of discrete and continuous time quantum walks is still not fully 
understood. 

The approximation algorithm for tensor networks as described in this chapter 
ifTOl is relatively new. However, it is inspired by the earlier work of Aharonov et 
al. ||3, as well as Yard and Wocjan |64|. The tensor network framework allows 
us to capture a wide range of problems, particularly problems of a combinatorial 
nature. Indeed, many #P-hard problems, such as evaluating the Tutte polynomial, 
can be captured as tensor networks. The difficulty arises from the additive nature 
of the approximation. One of the most important questions in this area is when this 
approximation is useful, and when the size of the approximation window renders 
the approximation trivial. 

Weve also briefly mentioned numerous other families of new quantum algorithms 
that we did not detail in this review article, and many others unfortunately go un- 
mentioned. We have tried to balance an overview of some of the classic techniques 
with a more in depth treatment of some of the more recent and novel approaches to 
finding new quantum algorithms. 

We expect and hope to see many more exciting developments in the upcoming 
years: novel applications of existing tools and techniques, new tools and techniques 
in the current paradigms, as well as new algorithmic paradigms. 
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